An Effective Fuzzy Clustering of Crime Reports Embedded by a Universal Sentence Encoder Model

نویسندگان

چکیده

Crime reports clustering is crucial for identifying and preventing criminal activities that frequently happened in society. In the proposed work, named entities a report are recognized to extract crime-related phrases subsequently, preprocessed by applying stopword removal lemmatization operations. Next, module of universal encoder model, called transformer, applied get sentence embedding each associated sentence, aggregation which finally provides vector representation report. An innovative efficient graph-based algorithm consisting splitting merging operations has been cluster crime reports. The generates overlapping clusters, indicates existence multiple types. fuzzy theory used provide score expressing its membership into different accordingly, labelled categories. efficiency method assessed taking account datasets comparing them with other state-of-the-art approaches help various performance measure metrics.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Universal Sentence Encoder

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity...

متن کامل

Merging Duplicate Bug Reports by Sentence Clustering

Duplicate bug reports are often unfavorable because they tend to take many man hours for being identified as duplicates, marked so and eventually discarded. In this time, no progress occurs on the program in question, and is justifiably an overhead which should be minimized. Considerable research has been carried out to alleviate this problem. Many methods have been proposed for bug report cate...

متن کامل

OPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM

This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...

متن کامل

Optimal Sentence Clustering Using An Innovative Hierarchical Fuzzy Clustering Algorithm

The role of data clustering is inevitable in many text processing activities .Many proceedings are going on in this area since it has wider applications. Sentence clustering is a challenging task when compared with other data clustering, because a sentence is able to represent same ideas in different ways. For E.g. some people see a glass as half empty and some others see half full. Due to this...

متن کامل

a tripartite model of efl teachers attributions, burnout, and ‎self-regulation: towards the prospects of effective teaching

همطالعه حاضر به ارائه مدلی برای آموزش موثر زبان انگلیسی ‏می پردازد. مدل حاضر از سه عامل تاثیر گذار در کارایی ‏تدریس معلمان زبان انگلیسی بهره می برد. این سه عامل شامل ‏سبکهای اسنادی، خود تنطیمی و فرسودگی شغلی معلمان ایرانی ‏زبان انگلیسی می باشد. رساله مورد نظر درچهار فاز طراحی ‏شده است: فاز اول شامل طراحی و رواسازی پرسشنامه سبکهای ‏اسنادی معلمان زبان انگلیسی و فاز دوم شامل استفاده از ‏این پرسشنا...

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11030611